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Abstract 

In this paper, we find an expression for the density of the sum of two independent d— dimensional Student random 
vectors X and Y with arbitrary degrees of freedom. As a byproduct we also obtain an expression for the density of 
the sum N + X, where N is normal and X is an independent Student i— vector. In both cases the density is given 
as an infinite series 

oo 

where /„ is a sequence of probability densities on and (c„) is a sequence of positive numbers of sum 1, i.e. the 
distribution of a non-negative integer-valued random variable C, which turns out to be infinitely divisible for d= 1 
and d = 2. When d = 1 and the degrees of freedom of the Student variables are equal, we recover an old result of 
Ruben. 



1. Introduction 

The Student distributions plays an important role in statistics for the analysis of normal samples. One of the 
major properties of these distributions is their heavy-tail behaviour; the same behaviour is shown by the famous 
family of stable distributions. The Student t-distributions show the advantage of having an explicit expression, 
what allows for example to write down explicitly their likelihood function this is not the case for the stable 
distributions, except some well-known particular cases. 

However, except in the Gaussian and Cauchy case, the Student t-distributions show the drawback of not being 
stable by convolution. Thus the study of their behavior by convolution is of high interest. This study has been first 
addressed in and more recently in (itI . @| , but concerns only the case where the degrees of freedom is an odd 
integer. The study of Student t-processes has recently given rise to a series of studies 

The d— dimensional Student density with 2v > Q degrees of freedom is 

r(. + i) 



r(i^)7r3 



/.(t) = A,,4l + ||t||^) % teM'*; A,,d- 

In the following, we provide an expression for the density of the sum 

Y = Ti + T2 

where Ti and T2 are d— variate independent t— distributed random variables with respective degrees of freedom 2v 



and 2/i. When fj, = u and d — 1, this equals an expression found by Ruben, cf. |l4l . formula 3.4]. As a byproduct, 
we also obtain an expression for the density of 

Z = X + T, 
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where X is standard Gaussian. Moreover, we extend this result to the case of Z = aiX + with ai, 02 > 0. In 
all cases, our expression is an infinite convex combination of a sequence of d— variate densities, which we describe 
next. 

Let us introduce the family of densities 

with k gN. We remark that go.cr is the c?— variate Gaussian density with zero mean and covariance matrix a^Id', for 
fc ^ 0, by @, Th. 2.9], gk,a- is the density of (T||X2fe+d||Ud where X2fc+d € R^fe+d Gaussian vector with identity 
covariance matrix and XJd is uniformly distributed on the unit sphere Sd of and independent of X2fe+d. 
Let us also introduce the family of densities 

^-•^ W - d. - e (2) 

7r2S(77,n + |) (1 + ||x||2)''+"+2 

where 77 > 0. We note that, with the same notation as above, (f>n,rj is the density of |lT2n+d|lUrf, where T2n+d G M^n+d 
follows a multivariate Student t-distribution with 2ry degrees of freedom 

(1 + llxlP)-"— ^ X e M-+'^. 
r(?7)7r"+f ^ " " ^ 

At last, we recall that a discrete random variable B follows a negative binomial distribution with parameters r 
and p (denoted as NB {r,p)) if 

Pt{B = k}= (^^^j^^^^p'' {1-p)'' , r > 0, <p < 1, fc 0,1.. . 

TVS distributions with non-integer parameter r are also known as Polya distributions. 

2. Results 

The distribution of the sum of two independent Student t— vectors 

Y = Ti + T2 

can be deduced by subordination from the distribution of Z = X + T where X is Gaussian : if j^, denotes the 
inverse Gamma measure with scale parameter j and shape parameter j/ > 

and if gt (x) denotes the d— variate Gaussian semi-group of normal densities 

gt (x) = (int)-^ exp (^-^) , t > 0, x e 
then the Student t— density can be obtained as the mixture 

r+oo 

fu (x) = / gt (x) d> (t) . 
Jo 

Therefore, we begin by finding an expression for the density of Z. 
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2.1. The sum of a Gaussian and a Student t— vector 
Our first result states as follows. 

Theorem 1 . The density 0/ Z = X + T can be written 

-f-C30 



fz{z)^Y.''i 9kA^) (3) 



fe=0 

where the densities gu^a- are defined in a^''''''' are positive coefficients that sum to 1 and given by 
and 7 is defined as 

Proposition 2. T/ie same result as in Theorem]^ holds for Z — aiX + a2T, ai e M+, a2 € M+ replacing a by aiu 
and 7 by — 7= — . 

' " (TV2ai 

Note that aiX and have the same distributions as (— aiX) and (— a2T) respectively, so that we only need 
to consider the case ai > 0, a2 > in the above Proposition. 

2.2. The sum of two Student t— vectors 

Starting from the distribution ^ we obtain the following 

Theorem 3. The distribution of Y = Ti + T2, where Ti and T2 are independent t— distributed with respective 
degrees of freedom 2v > Q and 2/i > 0, can be written as the infinite convex combination 

+00 

/y (X) = ^ C^^Vn,.+^ (X) , (6) 



n=0 



where 4)n,v+^ are the probability densities given by (0j and c^^'^^ are positive coefficients that sum to 1 defined by 
cM = ^ (^)» /' t^'+l-i (1 _ tr+^-^ (l-t + t^Y dt, n>0. (7) 

Note that |[7| shows that ci'''''-' — (^n'^^ as is to be expected. 
Remark 1. The Fourier transform of the Student density in the case d = 1 is 

/+00 
e"V,(x)dx = fc,(|y|) (8) 
-00 

with 

21—1^ 

u > 0, 

where K^, is the MacDonald function. 
We claim that 

(y) = (-1)" , 0n,M+. (y), ye M. (9) 
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+00 ^2n 



In fact, from we get 

D'^'U+.+n (y) = (-1)" / e"^A^+.+„4-^ ."'^..^„^„^i dx 

which is |[9]) because 



1\ (-) 



2; + 



Theorem [3] can be expressed for d = 1 as 



+ CSO 



kA\y\)k^{\y\) = Y.''n'''^^n,^+'^ iy) 



n=0 



hence by ^ 



V2/n 



71=0 



and finally by ([7]), for u > 0, 

n=0 

with 

Jo 

In terms of the Macdonald function, this equation can be written 



+00 / ,ii 



n!2" 

n=0 



We have not been able to find this expression in the literature. 

Still another interpretation of formula ([6]) for c? = 1 is to rewrite it as 



n=0 V2/n ^ / 

which shows that '^'^'^/a^)'' has a holomorphic extension to the open set |z £ C | Ij:^! < l} ' which is the region 
between the two branches of the equilateral hyperbola y"^ — = ^. 

Remark 2. In Ruben shows that in the d = 1 dimensional case, the density fz of the convolution Z — T1+T2 
can be written as a mixture of scaled Student t-distributions with fi + v degrees of freedom, i.e. as 

+00 



fz{z) = I (^) h{t)dt 



for some probability density h on ]2, +oo[. This representation differs from ([6]) which is a discrete mixture of the 
densities In the same paper, Ruben obtains several expressions for the density /z in the case ^ = v. Ruben's 
formula (3.4) coincides with formula ^ in the special case fi = v, d = 1. 

3. Properties of the mixing distributions 

In this section, we derive several properties of the discrete distributions |[4]) and ([7]). 



4 



3.1. Stochastic interpretation 



Since, as a consequence of the coefficients a^'^''''^ are positive, and since they sum to 1 (see Theorem [7]) , a 
stochastic interpretation of Theorem [1] is as follows: 

Z = X + T = G/f 

where, given K — k, Gk has the conditional density |(T|) and if is a discrete random variable with 

Vv {K = k} = a'^^''\ A: = 0,1,... (10) 

We deduce the following 
Proposition 4. The sum Z = X + T is distributed as cr |lX2K+(i|lU(; where 

• vector Ud is uniformly distributed on the unit sphere Sd in M'', 

• vector 'K2k+d G M^fc+rf Gaussian with identity covariance matrix, 



• random variable K defined by ^0\) follows a compound negative binomial distribution NB (^2'TTaj ' '^'^^re 

the parameter a is Gamma distributed with scale parameter 7^ and shape parameter v, (in short a {y^ 7^) ) 
so a has the density 

J. e"*'>' t"'^ t > 

V{y) ' 

An analogous result in the case of the sum of two Student t-vectors is as follows: Y = Ti + T2 is distributed 
as a random variable $Ar such that, given N = n, $jv has the conditional density given by ^ and iV is a discrete 
random variable defined by 

Pt{N = n} =cii''^'\ n = 0,l,... (11) 

We deduce the following 

Proposition 5. The sum Y = Ti + T2 is distributed as \\T2N+d\\Vd where 

• vector Urf is uniformly distributed on the sphere Sd in M'^, 

• random vector T2n+d G ]R.^"+'^ is Student t- distributed with 2v + 2/i degrees of freedom 

• random variable N defined by ^1]) follows a compound negative binomial distribution NB (|, t (1 — t)) , where 
the parameter t follows a Beta distribution with parameters /i and v. 

3.2. Haus dor ff moment sequence 

A sequence {sn} is called a Hausdorff moment sequence if it has the form 

1 

x^dfj, (x) , n = 0,1, . . . 



for some positive measure fi. It is called normalized if //q = 1- The following result holds: 

Theorem 6. Both sequences [ai"-'^] and [dr^] are Hausdorff moment sequences for d — I and d — 2. 
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3.3. Mean and Variance 

The first and second order moments of these distributions can be easily computed as a consequence of the fact 
that they are compound distributions. 

Theorem 7. The distribution fJOj) of K verifies 

+00 

EK = — , 1/ > 1, 

2 f , , 1^ d + 2{v- 1) 



In the case of the sum of Student t- variables, we have 
Theorem 8. The distribution ( 01]) of N verifies 

+00 

EN = d(Bi.-U^-l)_^ 
2\ B{iy,f,) 



'N 



B{v,^l) B{u,^i) B^v.^i) 



3.4. Asymptotic behaviors 

3.4.1. fixed value of ^ 

Let us now study the asymptotic behavior of the distribution ^ for large value of the degrees of freedom v : 
note that the two next theorems can be checked directly by looking at the stochastic representation of K as given 
in Proposition m but their proofs are instructive by themselves. 

Theorem 9. For a fixed value of the parameter 7, 

lim = ^ = ^ 

We note that this result is a consequence of the fact that since T has covariance matrix ^^^^ /rf, the random 
variable Z = X + T has the same distribution as X as ^ +cxd. 

3.4.2. covariance matrix o/T fixed 

Let us now look at the asymptotic behavior of these coefficients for large values of v but for a fixed value of the 
covariance matrix of T : since T has covariance matrix 2(^-1) ^(1, let us consider 



Z = X + i]^2{iy-l)T, (12) 



where X has covariance matrix cr^Id and 77-^/2 {v — 1)T has covariance matrix r]^Id. Define 



r]\/iy — 1 
7 = 



so that 

+00 



/z(z) = Eai'^'^\g,^.(z). (13) 
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Theorem 10. The coefficients a^^'^^ in I113\} behave for large v 



as 



fc!r(f) 



As a consequence, the density of Z ^ X + r?-\/2 (i/ — 1)T converges to a Gaussian density with covariance matrix 
{a^ + rf) Id, as expected. 

In Figd] are shown the succesive terms of the approximation of the density of Z in the one-dimensional case 
{d = 1): the upper curve is the density fz, the second one is the approximation of fz by the sum of the four first 
terms < fc < 3 in |[3l), and the four lower curves are the successive terms of this sum for k — 0,1,2 and 3. The first 
(fc = 0) term (Gaussian density) fits fz around z = 0, but the tail is underestimated; the other terms contribute to 
correct this underestimation, each adding, as k increases, a smaller contribution farther from the origin. 




Figure 1: from top to bottom, density fz in the case d = 1, its approximation by the sum of the first four term in Jsj and these first 
four terms 



4. Proofs 

4-1. Proof of Theorem]^ 

Using a result by Keilson and Steutel a stochastic representation for the random variable Z is 



Z = (iNi + = \/(t2 + 

V2a V 2a 



where N is d— variate standard normal, a is Gamma distributed F {v, 1) and = denotes equality in distribution. The 
density of Z reads thus 

fz (z) = TT-,^'"^"''—^ -T (-TTTT^] da. (14) 



2a} 



lo r(.) (2. (.2 + ^)). V 

Since exp J^-^^iWiL^^ ^ exp (-^) exp (M^^) with 72 = we deduce 

1 — 1 g^(m'r..„..,.,„,,.,-.- 



aV2^)'^'"''V 2a^jr{u)^^ k\ \2a^ 



da. 
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where the interchange of the sum and the integral is justified by the Lebesgue monotone convergence theorem. 
Since, using 0, 4.642], Vfc S N, 

which is equal to (ct\/2) T (fc + |) , the function gfc.a- as defined by ([T]) is a probability density and the result 
follows. Using the Kummer function 

1 r°° 

^{a,P;z) = -— e-^H'^-^l + tf-^-Ht, (15) 



cf. (lol . p. 1085], the integral in the expression of /z can be written 



^ . 2. Proof of Proposition [H 
We have 



with CT = ^cr so that 7 = 



0,2 



Z = 02 ( —X + T ) = 02 (ctNi + T) = a2Z 



and 



^2/ — 02 -2 



but ^5fc_|i.<^ (^) = 5fe,aio- (z) so that the result holds. 
4-3. Proof of TheoremlB 

Let us define t — ^ so that 7 = = and consider the fc— th term in the convex combination ([3]) 

^■^..^i'^> ^ f^)'»p f-M!) r ^^ti-i^-^ 



7r2r(j/)A;! V 4i y V 4t y 7o (l + 4ts) 
putting ui = its and subordinating with the inverse Gamma distribution yields 

' ''^^ ^ "^^^ ^ir(/.)r(^.) fc! io io (1 + 1.)'=+^ 



-dtdw. 



By the change of variable u = "'+11^11 +^ ^ this is equal to 



and, since the latter integral is a Gamma integral, to 



u 



^tr(/x)r(i.) fc! 7o (i + ii;)'=+Mw + ||x||2 + i)^+^+'^+^ 



dw 
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The latter integral can be identified as Euler's integral representation of a hypergeometric function 0, Th. 2.2.1 
p.65] 



_ r(2fc+p+|)r(..+|) / k + fi + i'+'i, 2fc + /i+| 
^{2k+^,+^+d) 2^^! 1 2k + fi + iy + d 



so that, using Pfaff's formula 0, 2.2.6 p.68] 



2F1 \x ] = [I - x) 2F1 



we deduce 



+00 



c'y ^ /--i c ' x-i 



r(^ + z/ + A: + |)r(i/ + |) r(2fc + ^ + |) 



fc + /i + z/+|, i^+l ||x||^ 



(1 + ||x||2)'=+^+''+^ V 2k + fl + l' + d 'l+||x||2 

Replacing the hypergeometric function by its series, we deduce 



a,. 



r(/i + i/ + fc + f)r(t/ + f) r(2fc + ^ + f) 

Trir{fi)r{iy)kl r{2k + ^i + iy + d) 

g(-+l),(fc + A^ + -+i), l|x|P'=+2' 



^ i2k + ^i + u + d)in (i + ||x||2)*^-+'+''+'^+i 

+00 

'^dk,i(t>i+k,n (x) 



1=0 

where rj = ^ + i/, the density (/)„^^ (x) is defined by ([2]) and the coefficient rffe^; is equal to 

r(^ + t/ + fc + |)r(j/ + f) r(2/c + M + i) ('^ + i)((fc + /^ + t^ + |),r(/i + z/)r(fc + ; + f) ttI 



7rir(^)r(zy)fc! r(2fc + Ai + i^ + d) {2k + fi + iy + d)^n T{^i + u + k + l + ^) T{!i) 

r {i^ + {ly + |)^^ {2k + fi + + i^)T {k + I + f ) 

r (^) r (zy) fc!/! r(2A: + z + /i + :/ + d)r(f) ' 

using the identity (a), T (a) = T (a + /) twice. Writing T (2A: + ^( + |) = (/^ + Dafe T + f ) , T (fc + / + |) = 
(l)fe+i ^ (1) ^^"^ T{2k + l + n + iy + d) = + + d)2fe+/ T (/i + i/ + d) , we deduce 



Now the density of Y = Ti + T2 is the sum over k of these terms and reads 

+00 



fv (x) = ^ dk,i(t)k+i,-n (x) 



fc,i=0 

We perform the change of variable n = fc + ^ to obtain 

+00 



/y (x) = ^ (?!)„,r, (x) ^ dfe 



n=0 k=0 
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The inner sum can be written 
where we have used the identity 

1 ^{^I + ly + d + n + k)^_f^ 

By the Chu-Vandermonde identity 0, P-70] 

and using the identities (p + f (m + j + 2<:),._i,_, = {(' + t)„+i,_, ond + f )_ + | + n - t)^, = + 
we deduce 

1— A; 



(...) ^ 1,^ + 1) (f)„ " -Y^A^-fcU, I d\ / ,f 

n-\-k—j V ^ / n— fc+j 



where we have used the change of summation index p ~ k — j and where the trinomial (3| coefficient (p) ^ is 

n\ fn\ fn — k\ 

j-\-k<n \ k— j = p 

Writing {fi + D^^^p r + I) = r + n + p + I) and (i^ + T (z. + |) = T (j. + | + n - p) and replacing 

the resulting Beta function by its integral representation we obtain 

and since the generating function of the trinomial coefRcient is 



p— — n 

we finally obtain 



E (!) -^ = (-+--^ + 1)' 

\— — 'n. ' 2 



B{iy,fi) nl Jo \ 1-t t ' 



which is the desired result. 
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.4- Proof of Theorem\d[ 



In the case of the coefficients ai"''^^ , we can factorize 



„('':7) 



ilk 
k\ 



Tk 



where 



T{v) 



+ 00 



-ds 



■ cxp 



du. 



This expression shows that the sequence {rfc} is also a Hausdorff moment sequence (not normaUzed) and hence 
a^,"'''^ is a Hausdorff moment sequence for d = 1,2 as product of two such sequences. It is not normahzed since 



+00 [y-n) 

k=0 "fc 



1. In fact 



' fc! 



is trivially a Hausdorff moment sequence when d = 2, and for d = 1 we have 



fc! 



s ds 



is a Hausdorff moment sequence when < a < 6, see 0, formula (26)] and if a > 6 



si 



a(6— a) 



< 0. 



It is a general fact that Sk — 

then Sk is not even a Hamburger moment sequence since the Hankel determinant D2 = S0S2 — ■'1 — b''(b+i) 

This shows that the density ^ is an infinite convex combination of the densities HI) and the discrete probability 

Q,(''^7) — J2k^o '^'k '^'^^k is infinitely divisible for d = 1, 2 since the sequence la^."'^^! is completely monotonic, which 

is the same as being a Hausdorff moment sequence. 
In the case of the sequence |ci'^'^^| , we find 



1 



B {v, fj.) n\ 



u X 






As before is a Hausdorff moment sequence for d — 1,2. 




du 



4-5. Proof of Theorem\^ 

Although the first result can be obtained by integrating ^ over z e M'', we check it directly since the computation 
of the sum is instructive by itself. For any u e ] — 1 , 1 [ , we have by the binomial series 



gr(fc 



fc=0 



fc! 



[l-uY- 



(17) 



so that, with u 



and we deduce 



a+7 



gr(fc + f) 

fc=0 



fc! 



a + 7^ 



a + Y 



+00 



+00 



r(i.)r(|) (^ + ^2)1 



da = 1. 
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The expectation and variances are easily computed from the stochastic representation of K — NB ; 



'•(i-p) 



the mean of a negative binomial variable NB (r, p) being ^ 

„ , C? „ TX77 d \ d'j'^ 

EK = -E^ = -i;- = — 

2 -r^ 2 a 2v-2 

l+a 

since a is F (1^,7^) ■ Since moreover, the second order moment of a negative binomial is '''^^"P)(^+''(^~p)) ^ a straight- 
forward computation gives 

„,l + a d l\ d f d\ 1 rfl 
EK^ = - \ E—^ + -E—\ = - [1 + -] E— + -E- 



and 



2/ (i/- l)(i^-2) 2i^- 1 



2 ^ , 7^ rf+ 2(^-1) 

^ 2I/-1 V 2 (i^-l)(t^-2) 



^.(?. Proof of Theorem\E 

The fact that the coefHcients ci'^'''"' sum to 1 is proved in the same way as in the Proof of Theorem [Tj the 
expectation is computed accordingly as 

en^^e(-1—^i 



2 \t{l-t) 
with 

1 1,^-1) 
^t{l~t) " B{iy,fi) 
The second order moment is similarly computed as 

EN' = d^ {l~t{l^t)){l + l{l-t{l-t))) ^d/ d\ ^ 1 _ djl + d) ^ 1 

2 i2(i_i)2 2 V 2 J t^l~tf 2 4 

so that the variance reads 

2 _ + 1,^- 1) d/ d\ B{iy-2,fi-2) d' f B {v - I, - I) 

and the result follows after simplification. 
4-7. Proof of Theorem\M 

It is clearly enough to prove that hmi^^oo ct'o'''^ — 1 since < < 1 — ao"'''^ for fc > 0. We have 

hence 



for some < ^ < 7 by the mean value theorem. For ly > 1 we therefore get 

t r 

which tends to for v ^ 00. 



1 — aX ~ / e a da = 
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4-8. Proof of Theorem\M 
The coefficients a"^'^'^ read 

using change of variable a — b{v — 1) . Let us apply Laplace's method fl6, p. 278]: if /' (xo) = and /" (xo) < 
then for large M, 

Choosing M = v — \, f {x) = —x + log a::, ^(x) = xi {x + a^) ^ and = 1, an equivalent of the right-hand side 
integral reads 

Thus an equivalent of the coefficients a^"'^^ reads 



fc!r(f) r(^) 

which, by Stirling's formula, is equivalent to 



k / 9 X -fe- 

As a consequence, for large values of the density /z of Z reads 

— r(f) /||z||2\''^ / r(fc + f) /^2x'c / ^2x-fe-f 



/z(z) ^ E 



1 1 



jcxp -—J- exp 



(7V27r 



1 

exp 



2a2 



with 



~2 2,2 
Cr = CT + ?7 . 



^ . P. Proof of Proposition ^ 

Conditionnally to a given value of if = fc, the random variable tT||X2fe+rf||Ud follows the distribution gk.a', let us 
check that the random variable K follows a NB compound distribution where a is F (i^, 7^) : conditionnally 

to a, such a compound distribution reads 



Pk\a = 



2 



l + a J \l + a 
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so that 



+00 „,2i/ 



k 

\ /JO ^ 

fcir (f ) r io ^ ^ 

by the change of variable b = 7^0; this expression coincides with |[4]). 

4-10. Proof of Proposition 

Conditionnally to a value oi N = n, the random variable ||T2„+d||Ud follows the distribution (j)n,u+tj.', we check 
that the random variable N follows a NB (|, i (1 — t)) , where t follows a Beta distribution with parameters ^ and 
v: conditionnally to t, this compound distribution reads 

Pnit=("+f {i-t)Ui-t{i-t)r 

so that 



1 r(n+f) f^,a+4-i,, .Ni'+f-i 



n!r(f) 7o 

which coincides with ([7]). 
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